418 Rec'd PCT/P7D 1 3 JAN 2000 



Form P7O-1390 U.S. DEPARTMENT OF COMMERCE PATENT AND TRADEMARK 
OFFICE 

TRANSMITTAL LETTER TO THE UNITED STATES 
DESIGNATED/ELECTED OFFICE (DO/EO/US) 
CONCERNING A FN INfi IINDFR ^5 II ^ C **71 


ATTORNEY'S DOCKET NUMBER 

GC382-US 

U.S. WKfWfffifH 

[please provide] 


INTERNATIONAL APPLICATION NO. INTERNATIONAL FILING DATE 

PCT/US98/14647 14 July 1998 


PRIORITY DATE CLAIMED 

15 July 1997 


TITLE OF THE INVENTION 

Proteases from Gram-Positive Organisms 



APPLICANT(S) FOR DOE/EO/US 



David A. ESTELL, US Citizen, residing in San Mateo, California 94403, USA 



Applicant herewith submits to the United States Designated/Elected Office (do/eo/us) the following items and other 
information: 



1 . El This is a FIRST submission of items concerning a filing under 35 U.S.C. 371 . 

2. □ This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371 . 

3. □ This is an express request to begin national examination procedures (35 U.S.C. 371 (f)) at any time rather than 

delay examination until the expiration of the applicable time limit set in 35 U.S.C. 371 (b) and PCT Articles 22 and 
39(1). 

4. El A proper Demand for International Preliminary Examination was made by the 19th month from the earliest 

claimed priority date. 

5. EI A copy of the International Application as filed (35 U.S.C. 371 (c)(2)) 

a. El is transmitted herewith (required only if not transmitted by the International Bureau). 

b. □ has been transmitted by the International Bureau. 

c. □ is not required, as the application was filed in the United States Receiving Office (RO/US). 

6. DA translation of the International Application into English (35 U.S.C. 371 (c)(2)). 

7. □ Amendments to the claims of the International Application under PCT Article 19(35 U.S.C. 371(c)(3)) 

a. □ are transmitted herewith (required only if not transmitted by the International Bureau). 

b. □ have been transmitted by the International Bureau. 

c. □ have not been made; however, the time limit for making such amendments has NOT expired. 

d. □ have not been made and will not be made 

8. DA translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 371(c)(3)). 

9. El An oath or declaration of the inventor/s (35 U.S.C. 371 (c)(4)). 

10. □ A translation of the annexes to the International Preliminary Examination Report under PCT Article 36 (35 U S C 

371(c)(5)). 

Items 11. to 16 below concern document/s or information included: 

11. □ An Information Disclosure Statement under 37 CFR 1 .97 and 1 .98. 

12. El An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.31 is 

included 

13. □ A FIRST preliminary amendment. 

□ A SECOND or SUBSEQUENT preliminary Amendment 

14. □ A substitute specification. 

15. □ A change of power of attorney and/or address letter. 

16. El Other items or information: 



■ Copy if I PER 

■ Copy of Publication of the International Search Report 



page 1 of 2 



430 Rec'd PGT/PTO j 3 JAN 2000" 



U.S. APPLICATION NO. (if known, see 37 CFR 1.5) 

Ip,eas {f^ v f^62845 


INTERNATIONAL APPLICATION NO. 

PCT/US98/14647 


ATTORNEY'S DOCKET NUMBER 

GC382-US 


17. S The following fees are submitted: 


CALCULATIONS PTO USE ONLY 


BASIC NATIONAL FEE (37 CFR 1.492(2)(1)-(5)): 

Search Report has been prepared by the EPO or JPO $ 840 

International preliminary examination fee paid to USPTO 

(37 CFR 1.482) $ 96 

No international preliminary examination fee paid to USPTO (37 CFR 

1 .482) but international search fee paid to USPTO (37 CFR $ 

1.445(2)(2)) 

Neither international preliminary examination fee (37 CFR 1 .482) nor $ 
international search fee (37 CFR 1 ,445(2)(2)) paid to USPTO 

International preliminary examination fee paid to USPTO (37 CFR $ 
1.482) and all claims satisfied provisions of PCT Article 33(2)-(4) 

ENTER APPROPRIATE BASIC FEE AMOUNT = 




$ 936 




Surcharge of $130 for furnishing the oath or declaration later than □ 20 D30 
months from the earliest claimed priority date (37 CFR 1 .492(4)). 


$ 




CLAIMS 


NUMBER FILED 


NUMBER EXTRA 


RATE 






Total Claims 


16 


-20 = 


0 


x $18.00 


$ 




Independent 
claims 


5 


- 3 = 


2 


x $78.00 


$ 156 




MULTIPLE DEPENDENT CLAIM(S) (IF APPLICABLE) 


+ $250.00 


$ 




TOTAL OF ABOVE CALCULATIONS = 


$1092 


Reduction of 1/2 for filing by small entity, if applicable. Verified Small Entity 
Statement must also be filed (Note 37 CFR 1 .9, 1 .27, 1 .28) 






SUBTOTAL = 


$1092 




Processing fee of $130 for furnishing the English translation later than D20 rj30 
months from the earliest claimed priority date (37 CFR 1 .492(f)). + 


$ 




TOTAL NATIONAL FEE = 


$ 




Fee for recording the enclosed assignment (37 CFR 1 .21 (h)). The assignment must 
be accompanied by an appropriate cover sheet (37 CFR 3.28, 3.31 ). $40.00 per 
property 


$ 40 




TOTAL FEES ENCLOSED = 


$1132 






Amount 

to be refunded: 

charged: 


$ 
$ 


a. □ A check in the amount of $ to cover the above fees is enclosed. 

h ^ Please charge my Deposit Account No. 07-1 048 in the amount of $1 1 32 to cover the above fees. 

A duplicate copy of this sheet is enclosed, 
c. £3 The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any 

overpayment to Deposit Account No. 07-1 048. A duplicate copy of this sheet is enclosed. 

NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive (37 
CFR 1.137(a) or (b)) must be filed and granted to restore the application to pending status. 

rffeTDTR^IER CET °' ^y^^ 

GENENCOR INTERNATIONAL, INC. / S ' - 

925 PAGE MILL ROAD / 

PALO ALTO, CALIFORNIA 94304-1013 Jeffery D FRAZIER 

(printed name) 

Tel . 650-846-2655 1 & tS~ 

Fax 650-845-6504 Registration Number: 34,601 



Form PTO-1390 page 2 of 2 



WO 99/03984 



09/462845 

430Rec'dP^5S 8/ 'n JAN 2000 



PROTEASES FROM GRAM-POSITIVE ORG ANISMS 



FIELD OF THE INVENTION 



The present invention relates to serine proteases derived from gram-positive 
microorganisms. The present invention provides nucleic acid and amino acid sequences of 
serine protease 1, 2, 3, 4 and 5 identified in Bacillus. The present invention also provides 
methods for the production of serine protease 1 , 2, 3, 4 and 5 in host cells as well as the 
production of heterologous proteins in a host cell having a mutation or deletion of part or all 
of at least one of the serine proteases of the present invention. 



Gram-positive microorganisms, such as members of the group Bacillus, have been 
used for large-scale industrial fermentation due, in part, to their ability to secrete their 
fermentation products into the culture media. In gram-positive bacteria, secreted proteins 
are exported across a cell membrane and a cell wall, and then are subsequently released 
into the external media usually maintaining their native conformation. 

Various gram-positive microorganisms are known to secrete extracellular and/or 
intracellular protease at some stage in their life cycles. Many proteases are produced in 
large quantities for industrial purposes. A negative aspect of the presence of proteases in 
gram-positive organisms is their contribution to the overall degradation of secreted 
heterologous or foreign proteins. 

The classification of proteases found in microorganisms is based on their catalytic 
mechanism which results in four groups: the serine proteases; metalloproteases; cysteine 
proteases; and aspartic proteases. These categories can be distinguished by their 
sensitivity to various inhibitors. For example, the serine proteases are inhibited by 
phenylmethylsuifonylfiuoride (PMSF) and diisopropylfluorophosphate (DIFP); the 
metalloproteases by chelating agents; the cysteine enzymes by iodoacetamide and heavy 
metals and the aspartic proteases by pepstatin. The serine proteases have alkaline pH 
optima, the metalloproteases are optimally active around neutrality, and the cysteine and 
aspartic enzymes have acidic pH optima (Biotechnology Handbooks, Bacillus , vol. 2, edited 
by Harwood, 1989 Plenum Press. New York). 

Proteolytic enzymes that are dependent upon a serine residue for catalytic activity 
are called serine proteases. As described in Methods in Enzymology, vol. 244, Academic 
Press, Inc. 1994, page 21, serine proteases of the family S9 have the catalytic residue triad 
"Ser-Asp-His with conservation of amino acids around them. 



BACKGROUND OF THE INVENTION 
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SUMMARY OF THE INVENTION 
The present invention relates to the unexpected discovery of five heretofore 
unknown or unrecognized S9 type serine proteases found in uncharacterized translated 
genomic nucleic acid sequences of Bacillus subtilis, designated herein as SP1, SP2, SP3, 
SP4 and SP5 having the nucleic acid and amino acid as shown in the Figures. The present 
invention is based, in part, upon the presence the amino acid triad S-D-H in the five serine 
proteases, as well as amino acid conservation around the triad. The present invention is 
also based in part upon the heretofore uncharacterized or unrecognized overall amino acid 
relatedness that SP1, SP2, SP3, SP4 and SP5 have with the serine protease dipeptidyl- 
amino peptidase B from yeast (DAP) and with each other. 

The present invention provides isolated polynucleotide and amino acid sequences 
for SP1, SP2, SP3, SP4 and SP5. Due to the degeneracy of the genetic code, the present 
invention encompasses any nucleic acid sequence that encodes the SP1, SP2, SP3, SP4 
and SP5 deduced amino acid sequences shown in Figures 2A-2B-Figure 6, respectively. 

The present invention encompasses amino acid variations of B. subtilis SP1, SP2, 
SP3, SP4 and SP5 disclosed herein that have proteolytic activity. B. subtilis SP1, SP2, 
SP3, SP4 and SP5, as well as proteolytically active amino acid variations thereof, have 
application in cleaning compositions. In one aspect of the present invention, SP1, SP2, 
SP3, SP4 and SP5 obtainable from a gram-positive microorganism are produced on an 
industrial fermentation scale in a microbial host expression system. In another aspect, 
isolated and purified SP1, SP2, SP3, SP4 or SP5 obtainable from a gram-positive 
microorganism is used in compositions of matter intended for cleaning purposes, such as 
detergents. Accordingly, the present invention provides a cleaning composition comprising 
at least one of SP1, SP2, SP3, SP4 and SP5 obtainable from a gram-positive 
microorganism. The serine protease may be used alone in the cleaning composition or in 
combination with other enzymes and/or mediators or enhancers. 

The production of desired heterologous proteins or polypeptides in gram-positive 
microorganisms may be hindered by the presence of one or more proteases which degrade 
the produced heterologous protein or polypeptide. Therefore, the present invention also 
encompasses gram-positive microorganism having a mutation or deletion of part or all of the 
gene encoding SP1, SP2, SP3, SP4 and/or SP5, which results in the inactivation of their 
proteolytic activity, either alone or in combination with deletions or mutations in other 
proteases, such as apr, npr, epr, mpr for example, or other proteases known to those of skill 
in the art. In one embodiment of the present invention, the gram-positive organism is a 
member of the genus Bacillus. In another embodiment, the Bacillus is Bacillus subtilis. 

In another aspect, the gram-positive microorganism host having one or more 
deletions or mutations in a serine protease of the present invention is further genetically 
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■ engineered to produce a desired protein. In one embodiment of the present invention, the 
desired protein is heterologous to the gram-positive host eel!. In another embodiment, the 
desired protein is homologous to the host ceil. The present invention encompasses a gram- 
positive host cell having a deletion or interruption of the naturally occurring nucleic acid 
encoding the homologous protein, such as a protease, and having nucleic acid encoding 
the homologous protein or a variant thereof re-introduced in a recombinant form. In another 
embodiment, the host cell produces the homologous protein. Accordingly, the present 
invention also provides methods and expression systems for reducing degradation of 
heterologous or homologous proteins produced in gram-positive microorganisms comprising 
the steps of obtaining a Bacillus host cell comprising nucleic acid encoding said 
heterologous protein wherein said host cell contains a mutation or deletion in at least one of 
the genes encoding SP1, SP2, SP3, SP4 and SP5; and growing said Bacillus host cell 
under conditions suitable for the expression of said heterologous protein. The gram-positive 
microorganism may be normally sporulating or non-sporulating. 

The present invention provides methods for detecting gram positive microorganism 
homologs of B. subtilis SP1, SP2, SP3, SP4 and SP5 that comprises hybridizing part or all 
of the nucleic acid encoding B. subtilis SP1, SP2, SP3, SP4 and SP5 with nucleic acid 
derived from gram-positive organisms, either of genomic or cDNA origin. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1A-1C shows the DNA (SEQ ID NO:1) and deduced amino acid sequence (SEQ 
IDNO:2)forSP1 (YUXL). 

Figure 2A-2B show an amino acid alignment between DAP (dap2_yeast) (SEQ ID NO:3) 
and SP1 (YUXL). For Figures 2A-2B, 3 and 4, the amino acid triad S-D-H is indicated. 

Figure 3 shows an amino acid alignment between SP1 (YUXL) (SEQ ID NO:2) and SP2 
(YTMA) (SEQ ID NO:5). 

Figure 4 shows and amino acid alignment between SP1 (YUXL) (SEQ ID NO 2) and SP3 
(YITV) (SEQ ID NO:7). 

Figure 5 shows and amino acid alignment between SP1 (YUXL) (SEQ ID NO:2) and SP4 
(YQKD) (SEQ ID NO:9). * 

Figure 6 shows and amino acid alignment between SP1 (YUXL) (SEQ ID N02) and SP5 
(CAH) (SEQ ID NO: 10). ' 

Figures 7A-7B shows the DNA (SEQ ID NO:4) and deduced amino acid sequence for SP2 
(YTMA) (SEQ ID NO:5). 4 

Figures 8A-8B shows the DNA (SEQ ID NO:6) and deduced amino acid sequence for SP3 
(YITV) (SEQ ID NO:7). 

Figures 9A-9B shows the DNA (SEQ ID NO:8) and deduced amino acid sequence for SP4 
(YQKD) (SEQ ID NO:9). 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Definitions - As used herein, the genus Bacillus includes all members known to those of 
skill in the art, including but not limited to B. subtilis, B. licheniformis, B. fentus, B. brevis, B. ' 
stearvthermophilus, B. atkalophilus, B. amyloliquefaciens, B. coagulans, B. ciculans, B. 
iautus and B. thuringiensis. 



RECTIFIED SHEET (RULE 91J 
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The present invention encompasses novel SP1, SP2, SP3, SP4 and SP5 from gram 
positive organisms. In a preferred embodiment, the gram-positive organisms is a Bacillus. 
In another preferred embodiment, the gram-positive organism is Bacillus subtilis. As used 
herein, "B.subtilis SP1 (YuxL) refers to the DNA and deduced amino acid sequence shown 
in Figures 1A-1C and Figures 2A-2B; SP2 (YtmA) refers to the DNA and deduced amino 
acid sequence shown in Figures 7A-7B and Figure 3; SP3 (YitV) refers to the DNA and 
deduced amino acid sequence shown in Figures 8A-8B and Figure 4; SP4 (YqkD) refers to 
the DNA and deduced amino acid sequence shown in Figures 9A-9B and Figure 5; and SP5 
(CAH) refers to the deduced amino acid sequence shown in Figure 6. The present 
invention encompasses amino acid variations of the B. subtilis amino acid sequences of 
SP1, SP2, SP3, SP4 and SP5 that have proteolytic activity. Such proteolytic amino acid 
variants can be used in cleaning compositions. 

As used herein, "nucleic acid" refers to a nucleotide or polynucleotide sequence, and 
fragments or portions thereof, and to DNA or RNA of genomic or synthetic origin which may 
be double-stranded or single-stranded, whether representing the sense or antisense strand. 
As used herein "amino acid" refers to peptide or protein sequences or portions thereof. A 
"polynucleotide homolog" as used herein refers to a novel gram-positive microorganism 
polynucleotide that has at least 80%, at least 90% and at least 95% identity to B.subtilis 
SP1, SP2, SP3, SP4 or SP5, or which is capable of hybridizing to B.subtilis SP1, SP2, SP3, 
SP4 or SP5 under conditions of high stringency and which encodes an amino acid 
sequence having serine protease activity. 

The terms "isolated" or "purified" as used herein refer to a nucleic acid or amino acid 
that is removed from at least one component with which it is naturally associated. 

As used herein, the term "heterologous protein" refers to a protein or polypeptide 
that does not naturally occur in a gram-positive host cell. Examples of heterologous 
proteins include enzymes such as hydrolases including proteases, cellulases, amylases, 
carbohydrases, and lipases; isomerases such as racemases, epimerases, tautomerases, or 
mutases; transferases, kinases and phophatases. The heterologous gene may encode 
therapeutically significant proteins or peptides, such as growth factors, cytokines, ligands, 
receptors and inhibitors, as well as vaccines and antibodies. The gene may encode 
commercially important industrial proteins or peptides, such as proteases, carbohydrases 
such as amylases and glucoamylases, cellulases, oxidases and lipases. The gene of 
interest may be a naturally occurring gene, a mutated gene or a synthetic gene. 

The term "homologous protein" refers to a protein or polypeptide native or naturally 
occurring in a gram-positive host cell. The invention includes host cells producing the 
homologous protein via recombinant DNA technology. The present invention encompasses 
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a gram-positive host cell having a deletion or interruption of the nucleic acid encoding the 
naturally occurring homologous protein, such as a protease, and having nucleic acid 
encoding the homologous protein, or a variant thereof re-introduced in a recombinant form. 
In another embodiment, the host cell produces the homologous protein. 

As used herein, the term "overexpressing" when referring to the production of a 
protein in a host cell means that the protein is produced in greater amounts than its 
production in its naturally occurring environment. 

As used herein, the phrase "proteolytic activity" refers to a protein that is able to 
hydrolyze a peptide bond. Enzymes having proteolytic activity are described in Enzyme 
Nomenclature, 1992, edited Webb Academic Press, Inc. 

Detailed Description of the Preferred Embodiments 

The unexpected discovery of the serine proteases SP1, SP2, SP3, SP4 and SP5 in 
B.subtilis provides a basis for producing host cells, expression methods and systems which 
can be used to prevent the degradation of recombinantly produced heterologous proteins. 
In a preferred embodiment, the host cell is a gram-positive host cell that has a deletion or 
mutation in the naturally occurring serine protease said mutation resulting in the complete 
deletion or inactivation of the production by the host cell of the proteolytic serine protease 
gene product. In another embodiment of the present invention, the host cell is additionally 
genetically engineered to produced a desired protein or polypeptide. 

It may also be desired to genetically engineer host cells of any type to produce a 
gram-positive serine protease SP1 , SP2, SP3, SP4 or SP5. Such host cells are used in 
large scale fermentation to produce large quantities of the serine protease which may be 
isolated or purified and used in cleaning products, such as detergents. 

I. Serine Protease Nucleic Acid and Amino Acid Sequences 

The SP1, SP2, SP3 and SP4 polynucleotides having the sequences as shown in the 
Figures encode the Bacillus subtilis serine SP1 , SP2, SP3, and SP4. As will be understood 
by the skilled artisan, due to the degeneracy of the genetic code, a variety of 
polynucleotides can encode the Bacillus SP1. SP2, SP3, SP4 and SP5. The present 
invention encompasses all such polynucleotides. 

The present invention encompasses novel SP1, SP2, SP3, SP4 and SP5 
polynucleotide homologs encoding gram-positive microorganism serine proteases SP1, 
SP2, SP3, SP4 and SP5, respectively, which have at least 80%, or at least 90% or at least 
95% identity to B.subtilis as long as the homolog encodes a protein that has proteolytic 
activity. 
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Gram-positive polynucleotide homologs of B.subtilis SP1, SP2, SP3, SP4 or SP5 
may be obtained by standard procedures known in the art from, for example, cloned DNA 
(e.g., a DNA "library"), genomic DNA libraries, by chemical synthesis once identified, by 
cDNA cloning, or by the cloning of genomic DNA, or fragments thereof, purified from a 
desired cell. (See, for example, Sambrook et a/., 1989, Molecular Cloning, A Laboratory 
Manual, 2d Ed., Cold Spring Harbor Laboratory Press. Cold Spring Harbor, New York; 
Glover, D.M. (ed.), 1985, DNA Cloning: A Practical Approach, MRL Press, Ltd., Oxford, U.K. 
Vol. I, II.) A preferred source is from genomic DNA. Nucleic acid sequences derived from 
genomic DNA may contain regulatory regions in addition to coding regions. Whatever the 
source, the isolated serine protease gene should be molecularly cloned into a suitable 
vector for propagation of the gene. 

In the molecular cloning of the gene from genomic DNA, DNA fragments are generated, 
some of which will encode the desired gene. The DNA may be cleaved at specific sites using 
various restriction enzymes. Alternatively, one may use DNAse in the presence of manganese 
to fragment the DNA, or the DNA can be physically sheared, as for example, by sonication. 
The linear DNA fragments can then be separated according to size by standard techniques, 
including but not limited to, agarose and polyacrylamide gel electrophoresis and column 
chromatography. 

Once the DNA fragments are generated, identification of the specific DNA fragment 
containing the SP1, SP2, SP3, SP4 or SP5 may be accomplished in a number of ways. For 
example, a B.subtilis SP1 , SP2, SP3, SP4 or SP5 gene of the present invention or its 
specific RNA, or a fragment thereof, such as a probe or primer, may be isolated and labeled 
and then used in hybridization assays to detect a gram-positive SP1, SP2, SP3, SP4 or SP5 
gene. (Benton, W. and Davis, R., 1977, Science 196:180; Grunstein, M. And Hogness, D., 
1975, Proc. Natl Acad. Sci. USA 72:3961^ Those DNA fragments sharing substantial 
sequence similarity to the probe will hybridize under stringent conditions. 

Accordingly, the present invention provides a method for the detection of gram- 
positive SP1, SP2, SP3, SP4 or SP5 polynucleotide homologs which comprises hybridizing 
part or all of a nucleic acid sequence of B. subtilis SP1 , SP2, SP3, SP4 or SP5 with gram- 
positive microorganism nucleic acid of either genomic or cDNA origin. 

Also included within the scope of the present invention are gram-positive 
microorganism polynucleotide sequences that are capable of hybridizing to the nucleotide 
sequence of B.subtilis SP1, SP2, SP3, SP4 or SP5 under conditions of intermediate to 
maximal stringency. Hybridization conditions are based on the melting temperature (Tm) of 
the nucleic acid binding complex, as taught in Berger and Kimmel (1987, Guide to Molecular 
Cloning Techniques Methods in Enzymology, Vol 152, Academic Press, San Diego CA) 
incorporated herein by reference, and confer a defined "stringency" as explained below. 



WO 99/03984 



PCT/US98/14647 



„ 7 - 

"Maximum stringency" typically occurs at about Tm-5°C (5°C below the Tm of the 
probe); "high stringency" at about 5°C to 10°C below Tm; "intermediate stringency" at about 
10°C to 20°C below Tm; and "low stringency" at about 20°C to 25°C below Tm. As will be 
understood by those of skill in the art, a maximum stringency hybridization can be used to 
identify or detect identical polynucleotide sequences while an intermediate or low stringency 
hybridization can be used to identify or detect polynucleotide sequence homologs. 

The term "hybridization" as used herein shall include "the process by which a strand 
of nucleic acid joins with a complementary strand through base pairing" (Coombs J (1994) 
Dictionary of Biotechnology . Stockton Press, New York NY). 

The process of amplification as carried out in polymerase chain reaction (PCR) 
technologies is described in Dieffenbach CW and GS Dveksler (1995, PCR Primer, a 
Laboratory Manual . Cold Spring Harbor Press, Piainview NY). A nucleic acid sequence of 
at least about 10 nucleotides and as many as about 60 nucleotides from B. subfilis SP1, 
SP2, SP3, SP4 or SP5 preferably about 12 to 30 nucleotides, and more preferably about 
20-25 nucleotides can be used as a probe or PCR primer. 

The B.subtilis amino acid sequences SP1, SP2, SP3, SP4 and SP5 (shown in 
Figures 2A-2B through Figure 6) were identified via a FASTA search of Bacillus subtilis 
genomic nucleic acid sequences. B. subtilis SP1 (YuxL) was identified by its structural 
homology to the serine protease DAP classified as an S9 type serine protease, designated 
in Figures 2A-2B as "dap2_yeast". As shown in Figures 2A-2B, SP1 has the amino acid 
dyad "S-D-H" indicated. Conservation of amino acids around each residue is noted in 
Figures 2A-2B through Figure 6. SP2 (YtmA); SP3 (YitV); SP4 (YqkD) and SP5 (CAH) were 
identified upon by their structural and overall amino acid homology to SP1 (YuxL). SP1 and 
SP4 were described in Parsot and Kebayashi, respectively, but were not characterized as 
serine proteases or serine proteases of the S9 family. 

H. Expression Systems 

The present invention provides host cells, expression methods and systems for the 
enhanced production and secretion of desired heterologous or homologous proteins in 
gram-positive microorganisms. In one embodiment, a host eel! is genetically engineered to 
have a deletion or mutation in the gene encoding a gram-positive SP1, SP2, SP3, SP4 or 
SP5 such that the respective activity is deleted. In an alternative embodiment of the 
present invention, a gram-positive microorganism is genetically engineered to produce a 
serine protease of the present invention. 
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Inactivation of a gram-positive serine protease in a host cell 
Producing an expression host cell incapable of producing the naturally occurring 
serine protease necessitates the replacement and/or inactivation of the naturally occurring 
gene from the genome of the host cell. In a preferred embodiment, the mutation is a non- 
reverting mutation. 

One method for mutating nucleic acid encoding a gram-positive serine protease is to 
clone the nucleic acid or part thereof, modify the nucleic acid by site directed mutagenesis 
and reintroduce the mutated nucleic acid into the cell on a plasmid. By homologous 
recombination, the mutated gene may be introduced into the chromosome. In the parent 
host cell, the result is that the naturally occurring nucleic acid and the mutated nucleic acid 
are located in tandem on the chromosome. After a second recombination, the modified 
sequence is left in the chromosome having thereby effectively introduced the mutation into 
the chromosomal gene for progeny of the parent host cell. 

Another method for inactivating the serine protease proteolytic activity is through 
deleting the chromosomal gene copy. In a preferred embodiment, the entire gene is 
deleted, the deletion occurring in such as way as to make reversion impossible. In another 
preferred embodiment, a partial deletion is produced, provided that the nucleic acid 
sequence left in the chromosome is too short for homologous recombination with a plasmid 
encoded serine protease gene. In another preferred embodiment, nucleic acid encoding 
the catalytic amino acid residues are deleted. 

Deletion of the naturally occurring gram-positive microorganism serine protease can 
be carried out as follows. A serine protease gene including its 5' and 3' regions is isolated 
and inserted into a cloning vector. The coding region of the serine protease gene is deleted 
form the vector in vitro, leaving behind a sufficient amount of the 5' and 3' flanking 
sequences to provide for homologous recombination with the naturally occurring gene in the 
parent host cell. The vector is then transformed into the gram-positive host cell. The vector 
integrates into the chromosome via homologous recombination in the flanking regions. This 
method leads to a gram-positive strain in which the protease gene has been deleted. 

The vector used in an integration method is preferably a plasmid. A selectable 
marker may be included to allow for ease of identification of desired recombinant 
microorgansims. Additionally, as will be appreciated by one of skill in the art, the vector is 
preferably one which can be selectively integrated into the chromosome. This can be 
achieved by introducing an inducible origin of replication, for example, a temperature 
sensitive origin into the plasmid. By growing the transformants at a temperature to which 
the origin of replication is sensitive, the replication function of the plasmid is inactivated, 
thereby providing a means for selection of chromosomal integrants. Integrants may be 
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selected for growth at high temperatures in the presence of the selectable marker, such as 
an antibiotic. Integration mechanisms are described in WO 88/06623. 

Integration by the Campbell-type mechanism can take place in the 5* flanking region 
of the protease gene, resulting in a protease positive strain carrying the entire plasmid 
vector in the chromosome in the serine protease locus. Since illegitimate recombination will 
give different results It will be necessary to determine whether the complete gene has been 
deleted, such as through nucleic acid sequencing or restriction maps. 

Another method of inactivating the naturally occurring serine protease gene is to 
mutagenize the chromosomal gene copy by transforming a gram-positive microorganism 
with oligonucleotides which are mutagenic. Alternatively, the chromosomal serine protease 
gene can be replaced with a mutant gene by homologous recombination. 

The present invention encompasses host cells having additional protease deletions 
or mutations, such as deletions or mutations in apr, npr, epr, mpr and others known to those 
of skill in the art. United States Patent 5,264,366 discloses Bacillus host cells having a 
deletion of apr and npr; United States Patent 5,585,253 discloses Bacillus host cells having 
a deletion of epr; Margot et a!., 1996, Microbiology 142: 3437-3444 disclose host cells 
having a deletion in wpr and EP patent 0369817 discloses Bacillus host cells having a 
deletion of mpr. 

III. Production of Serine protease 

For production of serine protease in a host cell, an expression vector comprising at 
least one copy of nucleic acid encoding a gram-positive microorganism SP1, SP2, SP3, 
SP4 or SP5, and preferably comprising multiple copies, is transformed into the host cell 
under conditions suitable for expression of the serine protease. In accordance with the 
present invention, polynucleotides which encode a gram-positive microorganism SP1, SP2, 
SP3, SP4 or SP5, or fragments thereof, or fusion proteins or polynucleotide homolog 
sequences that encode amino acid variants of B. SP1, SP2, SP3, SP4 or SP5, may be used 
to generate recombinant DNA molecules that direct their expression in host cells. In a 
preferred embodiment, the gram-positive host cell belongs to the genus Bacillus. In another 
preferred embodiment, the gram positive host cell is B. subtilis. 

As will be understood by those of skill in the art, it may be advantageous to produce 
polynucleotide sequences possessing non-naturally occurring codons. Codons preferred by 
a particular gram-positive host cell (Murray E et al (1989) Nuc Acids Res 17:477-508) can 
be selected, for example, to increase the rate of expression or to produce recombinant RNA 
transcripts having desirable properties, such as a longer half-life, than transcripts produced 
from naturally occurring sequence. 
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Altered SP1, SP2, SP3, SP4 or SP5 polynucleotide sequences which may be used 
in accordance with the invention include deletions, insertions or substitutions of different 
nucleotide residues resulting in a polynucleotide that encodes the same or a functionally 
equivalent SP1, SP2, SP3, SP4 or SP5 homoiog, respectively. As used herein a "deletion" 
is defined as a change in either nucleotide or amino acid sequence in which one or more 
nucleotides or amino acid residues, respectively, are absent. 

As used herein an "insertion" or "addition" is that change in a nucleotide or amino 
acid sequence which has resulted in the addition of one or more nucleotides or amino acid 
residues, respectively, as compared to the naturally occurring SP1, SP2, SP3, SP4 or SP5. 

As used herein "substitution" results from the replacement of one or more 
nucleotides or amino acids by different nucleotides or amino acids, respectively. 

The encoded protein may also show deletions, insertions or substitutions of amino 
acid residues which produce a silent change and result in a functionally SP1, SP2, SP3, 
SP4 or SP5 variant. Deliberate amino acid substitutions may be made on the basis of 
similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues as long as the variant retains the ability to modulate secretion. For 
example, negatively charged amino acids include aspartic acid and glutamic acid; positively 
charged amino acids include lysine and arginine; and amino acids with uncharged polar 
head groups having similar hydrophilicity values include leucine, isoleucine, valine; glycine, 
alanine; asparagine, glutamine; serine, threonine, phenylalanine, and tyrosine. 

The SP1, SP2, SP3, SP4 or SP5 polynucleotides of the present invention may be 
engineered in order to modify the cloning, processing and/or expression of the gene 
product. For example, mutations may be introduced using techniques which are well known 
in the art, eg, site-directed mutagenesis to insert new restriction sites, to alter glycosylation 
patterns or to change codon preference, for example. 

In one embodiment of the present invention, a gram-positive microorganism SP1, 
SP2, SP3, SP4 or SP5 polynucleotide may be ligated to a heterologous sequence to 
encode a fusion protein. A fusion protein may also be engineered to contain a cleavage site 
located between the serine protease nucleotide sequence and the heterologous protein 
sequence, so that the serine protease may be cleaved and purified away from the 
heterologous moiety. 

IV. Vector Sequences 

Expression vectors used in expressing the serine proteases of the present invention 
in gram-positive microorganisms comprise at least one promoter associated with a serine 
protease selected from the group consisting of SP1, SP2, SP3, SP4 and SP5, which 
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promoter is functional in the host cell. In one embodiment of the present invention, the 
promoter is the wild-type promoter for the selected serine protease and in another 
embodiment of the present invention, the promoter is heterologous to the serine protease, 
but still functional in the host eel!. In one preferred embodiment of the present invention, 
nucleic acid encoding the serine protease is stably integrated into the microorganism 
genome. 

In a preferred embodiment, the expression vector contains a multiple cloning site 
cassette which preferably comprises at least one restriction endonuclease site unique to the 
vector, to facilitate ease of nucleic acid manipulation. In a preferred embodiment, the vector 
also comprises one or more selectable markers. As used herein, the term selectable marker 
refers to a gene capable of expression in the gram-positive host which allows for ease of 
selection of those hosts containing the vector. Examples of such selectable markers 
include but are not limited to antibiotics, such as, erythromycin, actinomycin, 
chloramphenicol and tetracycline. 

V. Transformation 

A variety of host cells can be used for the production of SP1, SP2, SP3, SP4 or SP5 
including bacterial, fungal, mammalian and insects cells. General transformation 
procedures are taught in Current Protocols In Molecular Biology (vol. 1, edited by Ausube! et 
al., John Wiley & Sons, Inc. 1987, Chapter 9) and include calcium phosphate methods, 
transformation using DEAE-Dextran and electroporation. Plant transformation methods are 
taught in Rodriquez (WO 95/14099, published 26 May 1995). 

In a preferred embodiment, the host cell is a gram-positive microorganism and in 
another preferred embodiment, the host cell is Bacillus. In one embodiment of the present 
invention, nucleic acid encoding one or more serine protease(s) of the present invention is 
introduced into a host cell via an expression vector capable of replicating within the host 
cell. Suitable replicating plasmids for Bacillus are described in Molecular Biological Methods 
for Bacillus, Ed. Harwood and Cutting, John Wiley & Sons, 1990, hereby expressly 
incorporated by reference; see chapter 3 on plasmids. Suitable replicating plasmids for B. 
subtilis are listed on page 92. 

In another embodiment, nucleic acid encoding a serine protease(s) of the present 
invention is stably integrated into the microorganism genome. Preferred host cells are 
gram-positive host cells. Another preferred host is Bacillus. Another preferred host is 
Bacillus subtilis. Several strategies have been described in the literature for the direct 
cloning of DNA in Bacillus. Plasmid marker rescue transformation involves the uptake of a 
donor plasmid by competent cells carrying a partially homologous resident plasmid 
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(Contente et a/., Piasmid 2:555-571 (1979); Haima et a/., Mol. Gen. Genet. 223:185-191 
(1990); Weinrauch et a/., J. Bacteriol. 154(3): 1077-1 087 (1983); and Weinrauch et a/., J. 
Bacteriol. 769^:1205-1211 (1987)). The incoming donor piasmid recombines with the 
homologous region of the resident "helper piasmid in a process that mimics chromosomal 
transformation. 

Transformation by protoplast transformation is described for B. subtilis in Chang and 
Cohen, (1979) Mol. Gen. Genet 168:111-115; for B.megaterium in Vorobjeva et al., (1980) 
FEMS Microbiol. Letters 7:261-263; for B. amyloliquefaciens in Smith et al., (1986) Appl. 
and Env, Microbiol. 51:634; for B.thuringiensis in Fisher et al., (1981) Arch. Microbiol. 
139:213-217; for B.sphaericus in McDonald (1984) J. Gen. Microbiol. 130:203; and B. larvae 
in Bakhiet et al., (1985) 49:577. Mann et al., (1986, Current Microbiol. 13:131-135) report 
on transformation of Bacillus protoplasts and Holubova, (1985) Folia Microbiol. 30:97) 
disclose methods for introducing DNA into protoplasts using DNA containing liposomes. 

VI. Identification of Transformants 

Whether a host cell has been transformed with a mutated or a naturally occurring 
gene encoding a gram-positive SP1 , SP2, SP3, SP4 or SP5, detection of the 
presence/absence of marker gene expression can suggests whether the gene of interest is 
present However, its expression should be confirmed. For example, if the nucleic acid 
encoding a serine protease is inserted within a marker gene sequence, recombinant cells 
containing the insert can be identified by the absence of marker gene function. 
Alternatively, a marker gene can be placed in tandem with nucleic acid encoding the serine 
protease under the control of a single promoter. Expression of the marker gene in response 
to induction or selection usually indicates expression of the serine protease as well. 

Alternatively, host cells which contain the coding sequence for a serine protease and 
express the protein may be identified by a variety of procedures known to those of skill in 
the art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA 
hybridization and protein bioassay or immunoassay techniques which include membrane- 
based, solution-based, or chip-based technologies for the detection and/or quantification of 
the nucleic acid or protein. 

The presence of the cysteine polynucleotide sequence can be detected by DNA- 
DNA or DNA-RNA hybridization or amplification using probes, portions or fragments of 
B.subtilis SP1, SP2, SP3, SP4 or SP5. 



VII. Assay of Protease Activity 
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There are various assays known to those of skill in the art for detecting and 
measuring protease activity. There are assays based upon the release of acid-soluble 
peptides from casein or hemoglobin measured as absorbance at 280 nm or colorimetrically 
using the Folin method (Bergmeyer, et al., 1984, Methods of Enzymatic Analysis vol. 5, 
Peptidases, Proteinases and their Inhibitors, Verlag Chemie, Weinheim). Other assays 
involve the solubilization of chromogenic substrates (Ward, 1983, Proteinases, in Microbial 
Enzymes and Biotechnology (W.M. Fogarty, ed.), Applied Science, London, pp. 251-317). 

VIII. Secretion of Recombinant Proteins 

Means for determining the levels of secretion of a heterologous or homologous 
protein in a gram-positive host cell and detecting secreted proteins include, using either 
polyclonal or monoclonal antibodies specific for the protein. Examples include enzyme- 
iinked immunosorbent assay (ELISA), radioimmunoassay <RIA) and fluorescent activated 
cell sorting (FACS). These and other assays are described, among other places, in 
Hampton R et a! (1990. Serological Methods, a Laboratory Manual. APS Press, St Paul MN) 
and Maddox DE et al (1983, J Exp Med 158:1211). 

A wide variety of labels and conjugation techniques are known by those skilled in 
the art and can be used in various nucleic and amino acid assays. Means for producing 
labeled hybridization or PCR probes for detecting specific polynucleotide sequences include 
oligolabeling, nick translation, end-labeling or PCR amplification using a labeled nucleotide. 
Alternatively, the nucleotide sequence, or any portion of it, may be cloned into a vector for 
the production of an mRNA probe. Such vectors are known in the art, are commercially 
available, and may be used to synthesize RNA probes in vitro by addition of an appropriate 
RNA polymerase such as T7, T3 or SP6 and labeled nucleotides. 

A number of companies such as Pharmacia Biotech (Piscataway NJ), Promega 
(Madison Wl), and US Biochemical Corp (Cleveland OH) supply commercial kits and 
protocols for these procedures. Suitable reporter molecules or labels include those 
radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents as well as 
substrates, cofactors, inhibitors, magnetic particles and the like. Patents teaching the use of 
such labels include US Patents 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 
4,275,149 and 4,366,241. Also, recombinant immunoglobulins may be produced as shown 
in US Patent No. 4,816,567 and incorporated herein by reference. 

IX. Purification of Proteins 

Gram positive host cells transformed with polynucleotide sequences encoding 
heterologous or homologous protein may be cultured under conditions suitable for the 
expression and recovery of the encoded protein from cell culture. The protein produced by 
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a recombinant gram-positive host cell comprising a serine protease of the present invention 
will be secreted into the culture media. Other recombinant constructions may join the 
heterologous or homologous polynucleotide sequences to nucleotide sequence encoding a 
polypeptide domain which will facilitate purification of soluble proteins (Kroll DJ et al (1993) 
DNA Cell Biol 12:441-53). 

Such purification facilitating domains include, but are not limited to, metal chelating 
peptides such as histidine-tryptophan modules that allow purification on immobilized metals 
(Porath J (1992) Protein Expr Purif 3:263-281), protein A domains that allow purification on 
immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity 
purification system (Immunex Corp, Seattle WA). The inclusion of a cleavable linker 
sequence such as Factor XA or enterokinase (Invitrogen, San Diego CA) between the 
purification domain and the heterologous protein can be used to facilitate purification. 

X. Uses of The Present Invention 

Genetically Engineered Host Cells 

The present invention provides genetically engineered host cells comprising 
preferably non-revertable mutations or deletions in the naturally occurring gene encoding 
one or more of SP1, SP2, SP3, SP4 or SP5 such that the proteolytic activity is diminished or 
deleted altogether. The host cell may contain additional protease deletions, such as 
deletions of the mature subtilisn protease and/or mature neutral protease disclosed in 
United States Patent No. 5,264,366. 

In a preferred embodiment, the host cell is genetically engineered to produce a 
desired protein or polypeptide. In a preferred embodiment the host cell is a Bacillus. In 
another preferred embodiment, the host cell is a Bacillus subtilis. 

In an alternative embodiment, a host cell is genetically engineered to produce a 
gram-positive SP1, SP2, SP3, SP4 or SP5. In a preferred embodiment, the host cell is 
grown under large scale fermentation conditions, the SP1, SP2, SP3, SP4 or SP5 is 
isolated and/or purified and used in cleaning compositions such as detergents. WO 
95/10615 discloses detergent formulation. A serine protease of the present invention can be 
useful in formulating various cleaning compositions. A number of known compounds are 
suitable surfactants useful in compositions comprising the serine protease of the invention. 
These include nonionic, anionic, cationic, anionic or zwitterionic detergents, as disclosed in 
US 4,404,128 and US 4,261,868. A suitable detergent formulation is that described in 
Example 7 of US Patent 5,204,015. The art is familiar with the different formulations which 
can be used as cleaning compositions. In addition, a serine protease of the present 
invention can be used, for example, in bar or liquid soap applications, dishcare formulations, 
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contact iens cleaning solutions or products, peptide hydrolysis, waste treatment, textile 
applications, as fusion-cleavage enzymes in protein production, etc. A serine protease of 
the present invention may provide enhanced performance in a detergent composition (as 
compared to another detergent protease). As used herein, enhanced performance in a 
detergent is defined as increasing cleaning of certain enzyme sensitive stains such as grass 
or blood, as determined by usual evaluation after a standard wash cycle. 

A serine protease of the present invention can be formulated into known powdered 
and liquid detergents having pH between 6.5 and 12.0 at levels of about .01 to about 5% 
(preferably .1% to .5%) by weight. These detergent cleaning compositions can also include 
other enzymes such as known proteases, amylases, cellulases, lipases or 
endoglycosidases, as well as builders and stabilizers. 

The addition of a serine protease to conventional cleaning compositions does not 
create any special use limitation. In other words, any temperature and pH suitable for the 
detergent is also suitable for the present compositions as long as the pH is within the above 
range, and the temperature is below the described serine protease denaturing temperature. 
In addition, a serine protease of the present invention can be used in a cleaning 
composition without detergents, again either alone or in combination with builders and 
stabilizers. 

One aspect of the invention is a composition for the treatment of a textile that 
includes a serine protease of the present invention. The composition can be used to treat 
for example silk or wool as described in publications such as RD 216,034; EP 134,267; US 
4,533,359; and EP 344,259. 

Proteases can be included in animal feed such as part of animal feed additives as 
described in, for example, US 5,612,055; US 5,314,692; and US 5,147,642. 

Polynucleotides 

A B.subtlis SP1, SP2, SP3, SP4 or SP5 polynucleotide, or any part thereof, provides 
the basis for detecting the presence of gram-positive microorganism polynucleotide 
homologs through hybridization techniques and PCR technology. 

Accordingly, one aspect of the present invention is to provide for nucleic acid 
hybridization and PCR probes which can be used to detect polynucleotide sequences, 
including genomic and cDNA sequences, encoding gram-positive SP1, SP2, SP3, SP4 or 
SP5 or portions thereof. 

The manner and method of carrying out the present invention may be more fully 
understood by those of skill in the art by reference to the following examples, which 
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examples are not intended in any manner to limit the scope of the present invention or of 
the claims directed thereto. 

Example i 
Preparation of a Genomic library 

The following example illustrates the preparation of a Bacillus genomic library. 

Genomic DNA from Bacillus cells is prepared as taught in Current Protocols In 
Molecular Biology vol. 1, edited by Ausubel et al., John Wiley & Sons, Inc. 1987, chapter 2. 
4.1. Generally, Bacillus cells from a saturated liquid culture are lysed and the proteins 
removed by digestion with proteinase K. Cell wall debris, polysaccharides, and remaining 
proteins are removed by selective precipitation with CTAB, and high molecular weight 
genomic DNA is recovered from the resulting supernatant by isopropanol precipitation, if 
exceptionally clean genomic DNA is desired, an additional step of purifying the Bacillus 
genomic DNA on a cesium chloride gradient is added 

After obtaining purified genomic DNA, the DNA is subjected to Sau3A digestion. 
Sau3A recognizes the 4 base pair site GATC and generates fragments compatible with 
several convenient phage lambda and cosmid vectors. The DNA is subjected to partial 
digestion to increase the chance of obtaining random fragments. 

The partially digested Bacillus genomic DNA is subjected to size fractionation on a 
1% agarose gel prior to cloning into a vector. Alternatively, size fractionation on a sucrose 
gradient can be used. The genomic DNA obtained from the size fractionation step is 
purified away from the agarose and iigated into a cloning vector appropriate for use in a 
host celi and transformed into the host cell. 

Example II 

The following example describes the detection of gram-positive microorganism SP1. 
The same procedures can be used to detect SP2, SP3, SP4 or SP5. 

DNA derived from a gram-positive microorganism is prepared according to the 
methods disclosed in Current Protocols in Molecular Biology, Chap. 2 or 3. The nucleic acid 
is subjected to hybridization and/or PCR amplification with a probe or primer derived from 
SP1 . A preferred probe comprises the nucleic acid section encoding conserved amino acid 
residues. 

The nucleic acid probe is labeled by combining 50 pmol of the nucleic acid and 250 
mCi of [gamma 32 P] adenosine triphosphate (Amersham, Chicago IL) and T4 
polynucleotide kinase {DuPont NEN®, Boston MA). The labeled probe is purified with 
Sephadex G-25 super fine resin column (Pharmacia). A portion containing 10 7 counts per 
minute of each is used in a typical membrane based hybridization analysis of nucleic acid 
sample of either genomic or cDNA origin. 
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The DNA sample which has been subjected to restriction endonuciease digestion is 
fractionated on a 0.7 percent agarose gel and transferred to nylon membranes (Nytran Plus, 
Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 hours at 40 degrees 
C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under increasingly stringent conditions up to 0.1 x saline sodium citrate and 0.5% sodium 
dodecyl sulfate. The blots are exposed to film for several hours, the film developed and 
hybridization patterns are compared visually to detect polynucleotide homologs of B.subtilis 
SP1. The homologs are subjected to confirmatory nucleic acid sequencing. Methods for 
nucleic acid sequencing are well known in the art. Conventional enzymatic methods employ 
DNA polymerase Klenow fragment, SEQUENASE® (US Biochemical Corp, Cleveland, OH) 
or Taq polymerase to extend DNA chains from an oligonucleotide primer annealed to the 
DNA template of interest. 

Various other examples and modifications of the foregoing description and examples 
will be apparent to a person skilled in the art after reading the disclosure without departing 
from the spirit and scope of the invention, and it is intended that all such examples or 
modifications be included within the scope of the appended claims. All publications and 
patents referenced herein are hereby incorporated in their entirety. 
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CLAIMS 

1. A gram-positive microorganism having a mutation or deletion of part or all of one or 
more of the genes encoding a serine protease selected from the group consisting of SP1 , 
SP2, SP3, SP4 and SP5 said mutation or deletion resulting in the inactivation of the SP1, 
SP2, SP3, SP4 or SP5 proteolytic activity. 

2. The gram-positive microorganism according to Claim 1 that is a member of the 
family Bacillus. 

3. The microorganism according to Claim 2 wherein the member is selected from the 
group consisting of B. licheniformis, B. lentus, B. brevis, B. stearothermophilus, B. 
alkalophilus, B. amyloiiquefactens, B. coagulans, B. circulans, B. lautus and Bacillus 
thuringiensis. 

4. The microorganism of Claim 1 wherein said microorganism is capable of 
expressing a heterologous protein. 

5. The microorganism of Claim 4 wherein said heterologous protein is selected from 
the group consisting of hormone, enzyme, growth factor and cytokine. 

6. The microorganism of Claim 5 wherein said heterologous protein is an enzyme. 

7. The microorganism of Claim 6 wherein said enzyme is selected from the group 
consisting of a proteases, carbohydrases, and lipases; isomerases such as racemases, 
epimerases, tautomerases, or mutases; transferases, kinases and phophatases. 

8. A cleaning composition comprising a serine protease selected from the group 
consisting of SP1, SP2, SP3, SP4 and SP5. 

9. An expression vector comprising nucleic acid encoding a serine protease selected 
from the group consisting of SP1, SP2, SP3, SP5 and SP5. 

10. A host cell comprising an expression vector according to Claim 9 

1 1. A method for the production of a heterologous protein in a Bacillus host cell 
comprising the steps of 
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(a) obtaining a Bacillus host cell comprising nucleic acid encoding said 
heterologous protein wherein said host cell contains a mutation or deletion in at least 
one of the genes encoding serine protease 1 , serine protease 2 serine protease 3; 
serine protease 4 and serine protease 5. 

(b) growing said Bacillus host cell under conditions suitable for the expression of 
said heterologous protein. 

The method of Claim 1 1 wherein said Bacillus cell is selected from the group 
consisting of Bacillus subtilis, B. licheniformis, B. lentus, B. brevis, B. stearothermophilus, B. 
alkalophilus, B. amyloliquefaciens, B. coagulans, B. circulans, B. tautus and Bacillus 
thuringiensis. 

l % 

JA. The method of Claim 13 wherein said Bacillus host cell further comprises a mutation 
or deletion in at least one of the genes encoding apr, npr, epr, wpr and mrp. 

>5. A gram-positive microorganism having at mutation or deletion in at least one of the 
genes encoding a serine protease selected from the group consisting of serine protease 1, 
serine protease 2 serine protease 3; serine protease 4 and serine protease 5. 

AG- The microorganism of Claim 16 further comprising a mutation or deletion in at least 
one of the genes encoding apr, npr, epr, wpr and mrp. 
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10 30 
atgaaaaagctgataaccgcagacgacatcacagcgattgtctctgtg 
MKKLITADDITAIVSV 

50 70 90 

accgatcctcaatacgccccagacggtacccgtgccgcatatgtaaaa 

TDPQYAPDGTRAAYVK 

110 130 
tcacaagtaaatcaagagaaagattcgtatacatcaaatatatggatc 
SQVNQEKDSYTSNIWI 

150 170 190 

tatgaaacgaaaacgggaggatctgttccttggacacatggagaaaag 
YETKTGGSVPWTHGEK 

210 230 
cgaagcaccgacccaagatggtctccggacgggcgcacgcttgccttt 
RSTDPRWS PDGRTLAF 

250 270 2 

atttctgatcgagaaggcgatgcggcacagctttatatcatgagcact 
ISDREGDAAQLYIMST 

90 310 330 

gaaggcggagaagcaagaaaactgactgatatcccatatggcgtgtca 
EGGEARKLTDI PYGVS 

350 370 
aagccgctatggtccccggacggtgaatcgattctggtcactatcagt 
KPLWSPDGESILVTIS 

390 410 430 

ttgggagagggggaaagcattgatgaccgagaaaaaacagagcaggac 
LGEGES IDDREKTEQD 

450 470 
agctatgaacctgttgaagtgcaaggcctctcctacaaacgggacggc 
SYE PVEVQGLSYKRDG 

490 510 5 

aaagggctgacgagaggtgcgtatgcccagcttgtgcttgtcagcgta 

KGLTRGAYAQLVLVSV 

30 550 570 

aagtcgggtgagatgaaagagctgacaagtcacaaagctgatcatggt 

KSGEMKELTSHKADHG 
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590 610 
gatcctgctttttctcctgacggcaaatggcttgttttctcagctaat 
DPAFSPDGKWLVFSAN 

630 650 670 

ttaactgaaacagatgatgccagcaagccgcatgatgtttacataatg 
LTETDDASKPHDVYIM 
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690 710 
tcactggagtctggagatcttaagcaggttacacctcatcgcggctca 
S LESGDLKQVTPHRGS 

730 750 7 

t tcggatcaagctcattttcaccagacggaaggtatcttgctttgctt 
FGSSSFSPDGRYLALL 

70 790 810 

ggaaatgaaaaggaatataagaatgctacgctctcaaaggcgtggctc 

GNEKEYKNATLSKAWL 

830 850 
tatgatatcgaacaaggccgcctcacatgtcttactgagatgctggac 
YDI EQGRLTCLTEMLD 

870 890 910 

gttcatttagcggatgcgctgattggagattcattgatcggtggtgct 
VHLADALIGDSL IGGA 

930 950 
gaacagcgcccgatttggacaaaggacagccaagggttttatgtcatc 
EQRPIWTKDSQGFYVI 

970 990 10 

ggcacagatcaaggcagtacgggcatctattatatttcgattgaaggc 
GTDQGSTGIYYISIEG 

10 1030 1050 

cttgtgtatccgattcgtctggaaaaagagtacatcaatagcttttct 

LVYPIRLEKEYINSFS 

1070 1090 
ctttcacctgatgaacagcactttattgccagtgtgacaaagccggac 
LSPDEQHFIASVTKPD 

FIG-1B-1 

+ 

SUBSTITUTE SHEET (RULE 26) 



09/462845 



PCT/US98/14647 



3/16 

1110 1130 1150 

agaccgagtgagctttacagtatcccgcttggacaggaagagaaacag 
RPSELYSIPLGQEEKQ 

1170 1190 
ctgactggcgcgaatgacaagtttgtcagggagcatacgatatcaata 
LTGANDKFVREHTISI 

1210 1230 12 

cc tgaagagattcaatatgctacagaagacggcgtgatggtgaacggc 
P EE I QYATEDGVMVNG 

50 1270 1290 

tggctgatgaggcctgcacaaatggaaggtgagacaacatatccactt 

WLMRPAQMEGETTYPL 

1310 1330 
attcttaacatacacggcggtccgcatatgatgtacggacatacatat 
I LN I HGGPHMMYGHTY 

1350 1370 1390 

tttcatgagtttcaggtgctggcggcgaaaggatacgcggtcgtttat 
FHEFQVLAAKGYAVVY 
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1410 1430 
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DECLARATION 
AND POWER OF ATTORNEY 



AS A BELOW NAMED INVENTOR, I HEREBY DECLARE THAT: 

MY RESIDENCE, POST OFFICE ADDRESS AND CITIZENSHIP ARE AS STATED BELOW NEXT TO MY NAME. I BELIEVE I AM 
THE ORIGINAL, FIRST AND SOLE INVENTOR (IF ONLY ONE NAME IS LISTED BELOW) OR AN ORIGINAL, FIRST AND JOINT 
INVENTOR (IF PLURAL NAMES ARE LISTED BELOW) OF THE SUBJECT MATTER WHICH IS CLAIMED AND FOR WHICH A 
PATENT IS SOUGHT ON THE INVENTION ENTITLED PROTEASES FROM GRAM-POSITIVE ORGANISMS THE 
SPECIFICATION OF WHICH 



CHECK one: 

XX IS ATTACHED HERETO 
WAS FILED ON 



_ AS APPLICATION SERIAL NO. . 



AND WAS AMENDED ON _ 



I HEREBY STATE THAT I HAVE REVIEWED AND UNDERSTAND THE CONTENTS OF THE ABOVE IDENTIFIED 
SPECIFICATION, INCLUDING THE CLAIMS, AS AMENDED BY ANY AMENDMENT REFERRED TO ABOVE. I ACKNOWLEDGE 
THE DUTY TO DISCLOSE INFORMATION WHICH IS MATERIAL TO PATENTABILITY AS DEFINED IN TITLE 37, CODE OF 
FEDERAL REGULATIONS §1 .56. 

I HEREBY CLAIM FOREIGN PRIORITY BENEFITS UNDER TITLE 35, UNITED STATES CODE §1 1 9, OF ANY FOREIGN 
APPLICATIONS) FOR PATENT OR INVENTOR'S CERTIFICATE LISTED BELOW AND HAVE ALSO IDENTIFIED BELOW ANY 
FOREIGN APPLICATION FOR PATENT OR INVENTOR'S CERTIFICATE HAVING A FILING DATE BEFORE THAT OF THE 
APPLICATION ON WHICH PRIORITY IS CLAIMED. 



APPLICATION NUMBER 



DATE OF FILING 



PRIORITY CLAIMED 



I 



I HEREBY CLAIM THE BENEFIT UNDER TITLE 35, UNITED STATES CODE §120, OF ANY UNITED STATES 
APPLICATION(S) OR PCT INTERNATIONAL APPLICATION(S) DESIGNATING THE UNITED STATES OF AMERICA THAT IS 
LISTED BELOW AND, INSOFAR AS THE SUBJECT MATTER OF EACH OF THE CLAIMS OF THIS APPLICATION IS NOT 
DISCLOSED IN THE PRIOR UNITED STATES APPLICATION IN THE MANNER PROVIDED BY THE FIRST PARAGRAPH OF 
TITLE 35, UNITED STATES CODE §112,1 ACKNOWLEDGE THE DUTY TO DISCLOSE MATERIAL INFORMATION AS 
DEFINED IN TITLE 37, CODE OF FEDERAL REGULATIONS §1 .56(A) WHICH OCCURRED BETWEEN THE FILING DATE OF 
THE PRIOR APPLICATION AND THE NATIONAL OR PCT INTERNATIONAL FILING DATE OF THIS APPLICATION. 



APPLICATION NUMBER 


DATE OF FILING 


STATUS - PATENTED, PENDING OR ABANDONED 


PCT/US98/14647 
EP 97305232.7 


14 JULY 1998 

15 JULY 1997 


Pending 
abandoned 



POWER OF ATTORNEY: AS A NAMED INVENTOR I HEREBY APPOINT AS MY ATTORNEY(S) WITH FULL POWER OF 
SUBSTITUTION AND REVOCATION, TO PROSECUTE THIS APPLICATION AND TRANSACT ALL BUSINESS IN THE PATENT 
AND TRADEMARK OFFICE CONNECTED THEREWITH: 



(3> 



MARGARET A. HORN, REG. N O. 33,401; 
CHRISTOPHER L. STONE, REG. NO. 35,696 

JefferyD. Frazier, Reg. Nq_3A60J 
Susan Faris, Reg. No. 41,739 



SEND CORRESPONDENCE TO: 


DIRECT TELEPHONE CALLS TO! 


Jeffery D. Frazier 


650 846-7615 


genencor international, inc. 


925 Page Mill Road 




Palo Alto, California 94304 
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FULL NAME OF INVENTOR 


FULL FIRST NAME 

David 


INITIAL LAST NAME 

ESTELL 


RESIDENCE & CITIZENSHIP u *" 


-cTfY~~~ STATE OR FOREIGN COUNTRY COUNTRY OF CITIZENSHIP 

San Mateo. California (VK USA 


POST OFFICE ADDRESS 


POST OFFICEATJDRESS 

248 WOODBRIDGE CIRCLE, 


CITY STATE OR COUNTRY ZIP CODE 
SAN MATEO CALIFORNIA 94403 



I FURTHER DECLARE THAT ALL STATEMENTS MADE HEREIN OF MY OWN KNOWLEDGE ARE TRUE AND 
THAT ALL STATEMENTS MADE ON INFORMATION AND BELIEF ARE BELIEVED TO BE TRUE; AND FURTHER 
THAT THESE STATEMENTS WERE MADE WITH THE KNOWLEDGE THAT WILLFUL FALSE STATEMENTS 
AND THE LIKE SO MADE ARE PUNISHABLE BY FINE OR IMPRISONMENT, OR BOTH, UNDER SECTION 
1001 OF TITLE 1 8 OF THE UNITED STATES CODE , AND THAT SUCH WILLFUL FALSE STATEMENTS MAY 
JEOPARDIZE THE VALIDITY OF THE APPLICATION OR ANY PATENT ISSUING THEREON. 
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